Add tests for AMDGPU by kshyatt · Pull Request #271 · QuantumKitHub/TensorOperations.jl

kshyatt · 2026-05-11T09:15:31Z

No description provided.

lkdvos · 2026-05-11T14:35:26Z

Thanks for setting this up! Would be cool if the tests actually pass, but since this is not really relevant for what this PR is trying to achieve I would then still merge anyways.

kshyatt · 2026-05-11T14:38:54Z

AMD CI is really backed up but we can just see what happens when it finally clears, I guess

lkdvos · 2026-05-11T17:07:23Z

Ah, it turns out we need proper overloads for the allocator as well. I give this a quick try but I don't have access to any AMD machines so I can't actually try this out.

kshyatt · 2026-05-11T17:49:52Z

I do have access so I'll give it a whirl tomorrow

lkdvos · 2026-06-11T11:28:35Z

@kshyatt it seems like the AMD tests are still not all passing. Do you want me to wait with tagging a release to include this, or should I just go ahead?

kshyatt · 2026-06-11T11:34:31Z

I'll take a look later today and if I can't sort it out let's just tag?

kshyatt · 2026-06-15T11:48:54Z

This worked on 1.10 and 1.12 on AMD for me, let's see how CUDA does

lkdvos · 2026-06-15T11:57:32Z

    # resolve conj flags and absorb into StridedView constructor to avoid type instabilities later on
    if conjA
-        stridedtensoradd!(SV(C), conj(SV(A)), pA, α, β, backend, allocator)
+        stridedtensoradd!(SV(C), conj!(SV(A)), pA, α, β, backend, allocator)


Do we need this conj! there? It looks a bit suspicious to me, since I'm not sure we are really allowed to modify A, so we are really depending on the fact that conj(SV(A)) produces a view without modifying A.

This "resolved" the AMD tests but caused the CUDA ones to fail (I was testing on an AMD machine), I thought conj! on a StridedView would also return simply a view but I guess not

kshyatt · 2026-06-15T11:57:48Z

Oh my god it's a difference between conj and conj! on 1.10 I hate being able to read

kshyatt · 2026-06-15T12:31:17Z

I think the problem for AMD on 1.10 is somewhere in https://github.com/QuantumKitHub/Strided.jl/blob/main/ext/StridedGPUArraysExt.jl#L129 I'll keep digging a bit

kshyatt · 2026-06-15T13:08:30Z

OK, this is I think a problem in that kernel only on AMD and 1.10. Annoyingly you can't print from within a kernel on AMD super easily so I would say let's revert my last commit, merge this, and I'll keep working on the problem in Strided

This reverts commit 5502697.

lkdvos · 2026-06-15T13:23:05Z

I'm not entirely following since I don't really know what error you are getting (correctness or actual runtime errors?), but I'm definitely okay with opening an issue in Strided for this, as it definitely seems like that is the problem, and not the specific TensorOperations implementation.

kshyatt · 2026-06-15T13:24:11Z

It's correctness. The output of that kernel for the same inputs differs for AMD between 1.10 and 1.12

kshyatt requested a review from lkdvos May 11, 2026 09:15

kshyatt force-pushed the ksh/amd branch from 289cb64 to a228b85 Compare May 11, 2026 09:17

lkdvos previously approved these changes May 11, 2026

View reviewed changes

kshyatt enabled auto-merge (squash) May 11, 2026 14:38

lkdvos dismissed their stale review via f10e81a May 11, 2026 17:06

kshyatt force-pushed the ksh/amd branch from d7fe30a to 6b4fef9 Compare June 5, 2026 18:19

lkdvos previously approved these changes Jun 11, 2026

View reviewed changes

kshyatt and others added 4 commits June 11, 2026 07:28

Add tests for AMDGPU

8c8dc41

Add AMDGPU allocator support

1364d01

Get AMDGPU tensor ops working

3b343ae

Update Project.toml

2ee27e9

lkdvos force-pushed the ksh/amd branch from 05a335c to 2ee27e9 Compare June 11, 2026 11:28

Update pipeline.yml

9e3569f

kshyatt dismissed lkdvos’s stale review via 9e3569f June 12, 2026 11:33

kshyatt and others added 2 commits June 12, 2026 13:45

Merge branch 'master' into ksh/amd

c906f14

Fix AMD on 1.10

5502697

lkdvos reviewed Jun 15, 2026

View reviewed changes

Revert "Fix AMD on 1.10"

fa13d4d

This reverts commit 5502697.

lkdvos disabled auto-merge June 15, 2026 14:54

lkdvos merged commit 574b16d into master Jun 15, 2026
11 of 12 checks passed

lkdvos deleted the ksh/amd branch June 15, 2026 14:54

lkdvos mentioned this pull request Jun 15, 2026

Correctness error in AMD kernel with conj QuantumKitHub/Strided.jl#63

Open

Conversation

kshyatt commented May 11, 2026

Uh oh!

lkdvos commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kshyatt commented May 11, 2026

Uh oh!

lkdvos commented May 11, 2026

Uh oh!

kshyatt commented May 11, 2026

Uh oh!

lkdvos commented Jun 11, 2026

Uh oh!

kshyatt commented Jun 11, 2026

Uh oh!

kshyatt commented Jun 15, 2026

Uh oh!

lkdvos Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

kshyatt Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Jun 15, 2026

Uh oh!

kshyatt commented Jun 15, 2026

Uh oh!

kshyatt commented Jun 15, 2026

Uh oh!

lkdvos commented Jun 15, 2026

Uh oh!

kshyatt commented Jun 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lkdvos commented May 11, 2026 •

edited

Loading